Neural TTS, Voice Cloning, Real-time Audio, Kitten TTS
HAVE: Head-Adaptive Gating and ValuE Calibration for Hallucination Mitigation in Large Language Models
arxiv.org·1h
Something to critique
onebadbit.com·3h
TSPC: A Two-Stage Phoneme-Centric Architecture for code-switching Vietnamese-English Speech Recognition
arxiv.org·1h
Serialized Output Prompting for Large Language Model-based Multi-Talker Speech Recognition
arxiv.org·1d
LibriQuote: A Speech Dataset of Fictional Character Utterances for Expressive Zero-Shot Speech Synthesis
arxiv.org·4d
Loading...Loading more...